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Abstract 

Reinforced concrete is subjected to deterioration due to aging, increased load, and natural hazards. To minimize the 
maintenance costs and to increase the operation lifetime, researchers and practitioners are increasingly interested in 
improving current nondestructive evaluation technologies or building advanced structural health monitoring strategies. 
Acoustic emission methods offer an attractive solution for nondestructive evaluation/structural health monitoring of 
reinforced concrete structures. In particular, monitoring the development of cracks is of large interest because their 
properties reflect not only the condition of concrete as material but also the condition of the entire system at structural 
level. This article presents a new probabilistic approach based on Gaussian mixture modeling of acoustic emission to 
classify crack modes in reinforced concrete structures. Experimental results obtained in a full-scale reinforced concrete 
shear wall subjected to reversed cyclic loading are used to demonstrate and validate the proposed approach. 

Keywords 
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Introduction 

Over the past century, reinforced concrete (RC) has 
been widely used in civil structures such as buildings, 
bridges, nuclear power plants, or dams whose perfor- 
mance and function are vital to the security of our soci- 
ety. Unfortunately, many of these structures are facing 
an increasing number of challenges such as aging or 
natural events (i.e. earthquakes, hurricanes, and tsu- 
nami) that can jeopardize their safety and serviceability. 
To secure the overall soundness of these structures, 
proper assessment is crucial. In particular, cracks are of 
large interest in engineering practice (Jahanshahi et al., 
2013), since their properties reflect not only the condi- 
tion of concrete as a material but also the condition of 
the entire system at the structural level. Methodologies 
based on acoustic emission (AE) offer an attractive 
solution for monitoring the nucleation and growth of 
cracks in RC structures. AEs are stress waves caused by 
sudden strain releases, such as internal fractures or con- 
crete cracking (Grosse and Ohtsu, 2008). In general, a 
sparse array of piezoelectric transducers is placed in 
contact with the structure to detect these waves while 
some features are extracted from the detected AE 



signals to assess its structural integrity. Extracting fea- 
tures from AE signals is usually referred to as 
parameter-based technique. A threshold is used as a 
reference for the AE features. In general, this threshold 
is set somewhat above the background noise. Some of 
the most common features include peak amplitude, rise 
time, duration, and count (see Figure 1). The AE peak 
amplitude is defined as the maximum amplitude of the 
signal. The number of times the signal rises and crosses 
the threshold is the count of the AE event. The time 
period between the rising edge of the first count and the 
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falling edge of the last count is the duration of the AE 
event. Finally, the time period between the rising edge 
of the first count and the peak of the AE event is 

|AQ3| defined to be the rise time (Miller et al., 2005). The past 
few years have seen a surge of research aimed at deter- 
mining functional or correlation interdependencies 
between crack characteristics and AE features 
(Carpinteri et al., 2007; Farhidzadeh et al., 2012a; 
Ohtsu, 2010; Salamone et al., 2011, 2012; 
Vanniamparambil et al., 2012), including />-value analy- 
sis (Farhidzadeh et al., 2012b, 2013; Kurz et al., 2006; 
Shiotani et al., 2001), source localization (Dehghan- 
Niri et al., 2013; Grosse et al., 1997), and AE energy 

|AQ4| (Benavent-Climent et al., 201 1; Colombo et al., 2005). 
In this article, a new approach based on a Gaussian 
mixture modeling (GMM) is presented to perform 
crack mode classification in RC structures. GMM is an 
unsupervised distribution-based classification tech- 
nique (Bilmes, 1998) that has been successfully used in 
many fields, including sound recognition (Reynolds 
et al., 2000; Reynolds and Rose, 1995), image process- 
ing (Permuter et al., 2006), dynamical systems and 
tracking (Povinelli et al., 2004; Stauffer and Grimson, 
1999; Terejanu et al., 2008), and text recognition 
(Richiardi and Drygajlo, 2003); however, the use of this 
technique for AE-based structural health monitoring 
(SHM) has not been investigated. 

This article is organized as follows: in the next sec- 
tion, we briefly discuss the crack classification method 
developed by the Japan Construction and Material 
Standard (JCMS) (JCMS-IIIB5706, 2003) for RC 
structures, and its shortcomings are highlighted. The 
theoretical aspects of the GMM algorithm are then 
described. The following sections describe the experi- 
mental study used to validate the proposed algorithm 
along with the results. At the end, conclusions are 
summarized. 



Conventional crack mode classification 

In the last years, several efforts have been made to cor- 
relate AE features with the cracking mode in RC mem- 
bers (Aggelis, 2011; Beck et al., 2003; ElBatanouny 
et al., 2012; Farhidzadeh et al., 2012a; Ohno and 
Ohtsu, 2010; Shigeishi, 2001). Cracks can adversely 
affect structural performance in various ways such as 
durability and serviceability. In general, in loading such 
structures until failure, tensile cracks as a result of pure 
tensile stress develop at the initial stages of loading, 
while shear cracks dominate later (Yuyama et al., 
1999). Therefore, it may be beneficial to monitor the 
mode of cracks, as it may lead to a better prediction of 
the structural performance and eventually provide an 
early warning for the planning and implementation of 
remedial actions to the structure at a point where it is 
less expensive and invasive than when its structural 
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Figure I . AE parameters in an AE signal. 
AE: acoustic emission. 
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Figure 2. Conventional crack classification in JCMS-IIIB5706 
code. 



performance has been seriously compromised. A meth- 
odology for monitoring the crack propagation in RC 
structures has been proposed by the JCMS. This meth- 
odology is based on two AE parameters, namely, 
"average frequency (AF)" and "RA" value, which are 
defined as (see Figure 1) 

RA = (Rise time)/(Peak amplitude) (1) 

AF = (Counts)/Duration (2) 

According to the JCMS, AE sources can be classi- 
fied into tensile cracks and shear cracks, based on the 
relationship between RA values and average frequen- 
cies, as shown in Figure 2. However, a defined criterion 
on the proportion of these two parameters has not been 
confirmed yet (Ohno and Ohtsu, 2010). In fact, AE 
measurements are mostly random data and, in general, 
nonlinearly separable. Therefore, the distribution char- 
acteristics of the data set should be considered for 
developing a more robust classification algorithm. In 
this article, a new probabilistic approach based on a 
GMM is proposed to take into account data distribu- 
tion properties and classify AE sources into two domi- 
nant clusters, namely, shear and tensile. 
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Gaussian mixture modeling 

A GMM is a parametric probability density function 
represented as a weighted sum of M component 
Gaussian densities. For a Z>-dimensional measurement, 
training, or feature vector x, the mixture density is 
defined as (Reynolds and Rose, 1995) 



M 



p(x\\) = ^(OtNjQclJi,, 2 ) i = 1, 



,M 



(3) 



i = 1 



where co j are the mixture weights and 7V,-(x|/2-, £. ) are 
the unimodal component Gaussian (Normal) densities. 
Each component density is a Z>-variate Gaussian func- 
tion of the form: 



exp 



(2-f /2 |E,| 1/2 



(4) 



with D X 1 mean vector /I, and DX D covariance 
matrix £.. The mixture weights satisfy the constraint 



that X)/=i w i = l- The complete Gaussian mixture 
model is parameterized by the mean vectors, covariance 
matrices, and mixture weights from all component den- 
sities. These parameters are together represented as 



A = {«,-,/Z,-,^\}, i = 1, ... ,M 



(5) 



For a GMM-based classification system, the goal of 
the model training is to estimate the parameters of the 
GMM, A, so that the Gaussian mixture density can best 
match the distribution of the feature vector x. The next 
step is to find the best estimate for A. 

Maximum likelihood (ML) estimation is one of the 
most popular and well-established methods available 
for estimating co h jX t , and £,-. The aim of ML estima- 
tion is to find the model parameters, which maximize 
the likelihood of the GMM, given the training 
data. For a sequence of T training vectors 
X= {xi, ..., x T }, the GMM likelihood, assuming 
independence between the vectors, can be written as 



/>(*|A)= l[p(xt\X) 



(6) 
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Due to nonlinearity of this expression as a function 
of A, direct maximization (i.e. setting first derivative 
equal to zero and constraining the second derivative to 
be positive) is not computationally tractable. Instead 
ML parameters can be obtained iteratively by the 
expectation maximization (EM) algorithm (Dempster 
et al., 1977). 

The EM algorithm is an iterative procedure that 
maximizes the likelihood generated by each GMM 
(Hastie et al., 2009). The basic idea of the EM 



algorithm is beginning with an initial model A*, then 
estimate a new model X k+ ', such that p{X\k k + x )> 
p(X\X k ) (Reynolds and Rose, 1995). The new model 
then becomes the initial model for the next iteration, 
and the process is repeated until some convergence 
threshold is reached (i.e. 10~ 6 for log-likelihood). This 
algorithm consists of two steps, expectation and maxi- 
mization, which guarantee a monotonic increase in the 
likelihood value of the model (Dempster et al., 1977). 
The expectation step results in the a posteriori prob- 
ability for component ;', which is defined as the prob- 
ability that the state is i and that the mth Gaussian 
mixture accounts for the observation %, given the &th 
re-estimated model A /; 



7=1 



(7) 



The maximization step returns the distribution para- 
meter with these components (Reynolds and Rose, 
1995) 



f-i 



f +1 = ^Pr(/|f„A*) 



£ Pr(/|x„A A )x, 

k + 1 _ V = 1 



£Pr(*ft,A*) 

t= l 



(8) 



(9) 



* + i 

E 



1=1 



£ Pr (/|x,, A*)X& - 0* + l )(x, - iL\ + l f 



£Pr(/|x„A*) 

t= l 



(10) 



The use of a GMM may be motivated by the intui- 
tive notion that the individual component densities 
may model some underlying set of hidden classes 
(Rabiner and Juang, 1986; Rammohan and Taha, 
2005; Reynolds and Rose, 1995). This possibility moti- 
vates application of GMM for crack mode classifica- 
tion in concrete structures with two hidden classes, that 
is, shear and tensile crack (i.e. M — 2) in a two- 
dimensional measurement vector, x = (RA,AF). 

To implement a system for crack mode classification 
in concrete structures, the feature (or measurement) is a 
2D vector (i.e. x = (RA,AF)), the sequence of T training 
vectors is X = {x\ = (RA\,AFi), . . . , x, = (RA,,AF,), 
...,xt~ (RA t ,AF t )}, and the latent or hidden classes 
are / = {1,2} for tensile and shear mode, respectively. 
We aim at estimating the parameters of the GMM 
(weight, mean, and covariance matrix for each hidden 
class), which best matches the distribution of the 
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Figure 3. Experimental test: (a) test setup view, AE sensor layout (SI-S8), and dimensions; (b) load protocol. 
AE: acoustic emission. 



training feature vectors. The overall procedure can be 
summarized as 



1. 



2. 
3. 



4. 



Initializing the parameters in A . The initial guess 
of the parameters of the state-dependent 
Gaussian mixtures is obtained using vector 
quantization with two codes (Zhou et al., 2009). 
Applying equation (7) to get Pr (i\x,,X k ). 
Using Pr(/|x,,A*) to compute a better estimate 
for the parameter A + ' (i.e. applying equations 
(8) to (10)). 
Iterating steps 2 and 3 until convergence. 



Experimental study 

The test specimen was a large-scale rectangular RC 
shear wall with a height-to-width ratio of 0.94, designed 
based on ACI3 18-08, Chapter 21, earthquake-resistance 
structures (ACI Committee 318, 2008). The width, 
thickness, and effective height (from foundation to the 
centerline of loading) were 305 (120 in), 20.3 (8 in), and 
287 cm (112.8 in), respectively (Luna, 2013; Rocks, 
2012). The horizontal and vertical reinforcement ratios 
were 0.67%. Compressive strength of concrete on the 
day of test was 24.8 MPa, and the yield and ultimate 
strength of the reinforcing bars were 464 and 708 MPa, 
respectively (Rocks, 2012). The main components of 
the AE system included an eight-channel high-speed 
data acquisition board (Physical Acoustics Corporation 
Micro-II PAC) and a dedicated software for signal pro- 
cessing and storage (AEwin). The test specimen was 
instrumented with eight R15a sensors with resonance 
frequency of 1 50 kHz and operating frequency range of 
50^400 kHz. They were attached to one face of the wall 
using hot glue. Preamplifiers were set at 40 dB gain, 



analog bandpass filters were adjusted in the interval of 
20^100 kHz, and the threshold level was set to 35 dB. 
The experimental setup and load protocol are shown in 
Figure 3. The lateral load was applied to the specimen 
by two horizontally inclined high force capacity actua- 
tors. The specimen was subjected to a displacement 
controlled reversed cyclic loading with a loading rate of 
0.6 mm/s. The loading protocol consisted of 11 load 
steps (LSs). In particular, the first load step (LS0) had 
three cycles, whereas the subsequent load steps had two 
cycles. LS0 was carried out to verify the functionality of 
the experimental setup. The crack classification analysis 
was carried out using data recorded between LS2 and 
LS9. In fact, a very low AE activity was recorded prior 
to LS2, whereas in LS10 the specimen was severely 
damaged resulting in a significant amount of AE activ- 
ity mostly due to concrete spalling, wall-foundation 
sliding, and rebar yielding. The force-displacement hys- 
teresis loops and the corresponding backbone curves 
are illustrated in Figure 4. An initial lateral stiffness 
reduction of the specimen, caused by the nucleation of 
microcracks and hairline cracks (Farhidzadeh et al., 
2012a, Rocks, 2012), was observed after LS2. In a pre- 
vious work, authors demonstrated, using Sifted />-value 
(Sb) analysis (Farhidzadeh et al., 2012a, Farhidzadeh 
and Salamone, 2012), that a microcrack-macrocrack 
transition occurred in LS3. Then the wall behaved line- 
arly up to LS6. Significant nonlinear response occurred 
in LS7, and finally, the wall reached its ultimate 
strength at LS9. Figure 5 exhibits cracking at peak 
deformation of the first cycle in load steps 2, 5, and 9 to 
manifest the development of orientation and positions 
of cracks. The cracking patterns at the negative load 
peaks were consistent with these pictures due to the 
symmetric behavior of the wall. Tensile (horizontal) 
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Figure 4. Force-displacement hysteresis loops and backbone 



and shear cracks (diagonal) are mapped on the photo- 
graphy by solid and dash lines, respectively. In the ini- 
tial load steps, the cracks started from tensile mode and 
then they were followed by diagonal shear cracks in 
the intermediate load steps. The diagonal cracks 
became dominant at the final load steps (see LS9 in 
Figure 5). In addition, maximum and modal (i.e. 
appearing most frequently) crack widths are marked 
in each picture with a square dot (□) and with a circle 
(o), respectively. 



Experimental results 

In this section, the results of the crack classification 
using the proposed GMM algorithm are presented. For 
each load step, the two features defined in equations 
(1) and (2) (i.e. RA and AF) were extracted from each 
AE signal detected. Then, a moving average with span 
and lag of 70 hits was carried out to reduce scatter 
(Aggelis et al., 2011; Ohtsu, 2010; Soulioti et al., 2009). 



As a result, the data set X, consisting of data points 
Xj = (RAj,AFi), was generated, that is 

X = {(RA U AF 1 ),(RA 2 ,AF 2 ), ...,(RA T ,AF T ), x t eR 2 }. 

In order to apply the GMM algorithm, the number 
of classes should be defined. The sufficiency of two 
dominant clusters selected in this work can be investi- 
gated using the Bayesian information criterion (BIC), 
which is 



BIC 



2 In (L) + kX\n(T) 



(11) 



where L is the maximized value of the likelihood func- 
tion for the estimated model, T is the number of obser- 
vations, and k is the number of free parameters to be 
estimated for each GMM. The BIC criterion provides a 
tradeoff between model accuracy and complexity of the 
model. It should be noticed that ideally the data set can 
be fitted more accurately by increasing the number of 
mixture components. However, this leads to the estima- 
tion of more number of parameters and hence increases 
model complexity. Furthermore, this can also lead to 
overfitting the data and experiment noise. BIC can help 
to select the optimum number of clusters required while 
maintaining a balance between accuracy and model 
complexity. Figure 6 illustrates the variation of BIC by 
additional number of clusters for each load step. It can 
be observed that from one cluster to two clusters, the 
BIC decreases significantly and follow a fairly constant 
trend by including additional clusters. Moreover, in 
order to explore the efficacy of adding a cluster with a 
better resolution, the BIC change ratio {t- M + j) is calcu- 
lated according to the following equation when M com- 
ponent Gaussian densities increase to M + 1 



& 



BICm — BICm + i 
BICm 



(12) 



where BIC M is the BIC with M mixture components. 
Figure 7 depicts the variation of £ versus additional 




Figure 5. Cracks position, orientation, and mode in the positive peak of first cycles of load steps: (a) LS2, (b) LSS and (c) LS9. 
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Figure 6. BIC for each load step. 
BIC: Bayesian information criterion. 



mixture component. The BIC reduces from one cluster 
to two clusters ranging from 4% to 11% depending on 
the load step with an average of 6%. However, this 
reduction is not significant in any load step by adding 
another mixture component because the improvement 
is less than 3%. Therefore, considering the computation 
expenses, two clusters can efficiently fit the data distri- 
bution. This outcome is consistent with other studies in 
which the selection of two clusters has been suggested 
(Aggelis, 2011; JCMS-IIIB5706, 2003; Ohtsu, 2010; 
Ohno and Ohtsu, 2010). 

To compare the outcome of GMM with conven- 
tional JCMS approach, Figure 8 illustrates the feature 
vector X recorded during the LS2 as an example. As is 
said earlier, the conventional classification scheme 
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standard deviation. 
BIC: Bayesian information criterion. 



(Figure 8(a)) does not provide a well-established criter- 
ion to define the separator line, here indicated by 
dashed lines; therefore, this method cannot provide 
conclusive results. On the other hand, Figure 8(b) dis- 
plays the contour plot of the Gaussian mixture density 
function. The following observations can be made from 
this figure: (a) despite Figure 2 that implies two 
mutually exclusive categories of shear and tensile, it 
can be observed that these classes are intersected and 
have a mixed Gaussian density function (with a larger 
weight for the tensile class in this load step) and (b) 
although a straight line may not be an appropriate 
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boundary for classification, the GMM can smooth the 
path toward identifying the optimum discriminator 
line, which is simply the segment bisector of a virtual 
line connecting the means of the two classes (dash-dot 
line). In this article, however, a new criterion will be 
later proposed instead of a straight line to discriminate 
the two classes. 

To illustrate the effect of the window span size on 
the GMM results, Figure 9 shows the feature vector X 
in LS2 obtained using different window span sizes (i.e. 
n = 30, 50, 70, 100) and its corresponding GMM results. 
It can be observed that, although the span of the mov- 
ing window reduces the number of data (Figure 9(a), 
(c), (e), and (g)), the shape of the GMM results and the 
estimated mixture weights u> j are not significantly 
affected. 

To monitor the overall behavior of the wall during 
the experiment, the results of the GMM for the load 
steps 2, 3, 5, 6, 7, and 9 are illustrated in Figure 10. 
These results were obtained using data from all eight 
sensors. The red regions represent higher probability 
density all the way through the blue regions, which 
depict smaller densities. The tensile and shear classes 
are encircled by dashed ellipses representing the confi- 
dence interval of 0.9. By definition, confidence interval 
gives an estimated range of values, which is likely to 
include an unknown population parameter (Easton and 
McColl, 1997) and vary between and 1. Three stages 
can be identified in Figure 10: (a) the dominance of ten- 
sile cracks during the initial load steps (i.e. LS2 and 
LS3). In these load steps, the concentration of feature 
vectors is around the mean of the tensile class, (b) A 
transition stage from tensile to shear in intermediate 
load steps (LS5 and LS6). In this stage, the high prob- 
able area is gradually moving toward the mean of the 
shear class, (c) Dominance of shear cracks at the final 
load steps (LS7 and LS9). In this stage, the most likely 
area is concentrated around the mean of the shear class. 

Another equally interesting observation is that dur- 
ing the initial loading steps (i.e. LS2 and LS3), the RA 
components in the shear class are widely distributed 
between and 60 |xs/dB. This is likely due to the fact 
that microcracks that are generally nucleated during 
small levels of loads generate AE signals with small 
amplitudes; this results in larger RA values (see equa- 
tion (1)). On the other hand, during intermediate and 
larger loadings (i.e. LS4 onward), the signals are gener- 
ally of higher amplitudes and consequently RA value 
varies in a smaller range. 

To show the sensitivity of the proposed approach to 
the sensor location, Figures 11 and 12 show the GMM 
results for each sensor at LS2 and LS9, respectively. 
Figure 1 1 shows that all sensors mostly received tensile 
signals in LS2, which is consistent with the results 
shown in Figure 10(a). Figure 12 shows that the overall 
dominant mechanism in LS9 is shear; however, it can 
be observed that some sensors (i.e. SI, S2, and S6) 



indicate a higher probability of tensile mode. These are 
the sensors closer to the edges and bottom corners of 
the specimen, where the flexural cracks are dominant 
as shown in Figure 5. Therefore, sensor-based GMM 
may estimate the region of flexural and diagonal shear 
cracks given the sensor location. 

Likelihood ratio test 

To improve the classification results, a new criterion is 
proposed to define the cluster boundaries by keeping 
the data points with higher likelihood into the tensile or 
shear cluster and associate the rest of data to another 
cluster, namely, "mixed." With this aim, the likelihood 
ratio test is used as follows: 



^ = -21n 



like (X) for 1 st model (Tensile) 
like (X) for 2nd model (Shear) 

"^sSi/zj. — > Tensile 

i// r <"*"<i// 5 — > Mixed 
^P > if/ s — > Shear 



(12) 



where M* is the likelihood ratio and hke(X) is the final 
maximized likelihood in GMM algorithm. Now the 
boundary of clusters can be simply defined by certain 
values of M*, that is, ip T and \p s for tensile and shear, 
respectively. In this work, the data points in which the 
likelihood of being tensile is more than twice the likeli- 
hood of being shear are clustered in tensile class. 
Therefore, ij/ T = —2 In (2)= —1.4 is selected as a 
boundary of tensile class; likewise, the boundary for 
shear class is defined by data points in which likelihood 
of being shear is twice the likelihood of being tensile 
(i.e. ip s = — 21n(l/2) = 1.4. Figure 13 illustrates how 
the new criterion enhanced the classification by provid- 
ing an additional option for the operator to select a 
desired likelihood ratio. The blue diamonds (O) are 
associated to the data in which the likelihood of belong- 
ing to the tensile mode is twice the likelihood of belong- 
ing to the shear mode. The black dots (.) is called 
"mixed" cluster since it includes the data with less prob- 
ability of being associated to a specific clusters. The red 
circles (o) are the data with likelihood higher than twice 
the likelihood of being tensile. 

To track the classification results for all load steps, 
the number of AE activities associated with each cluster 
is shown in Figure 14. It can be observed that from load 
steps 2-A, the majority of AE signals were generated by 
nucleation of tensile cracks (solid lines in Figure 5). In 
load steps 5 and 6, the amount of shear cracks began to 
surpass tensile cracks; this is called transition stage. 
Finally, a growth of shear cracks was observed in load 
steps 7-9. Moreover, by comparing these results with 
the force-displacement hysteresis loops (Figure 4), the 
following observations can be made: the proposed algo- 
rithm shows large AE activity associated to shear mode 
at the LS7 in which the onset of yielding was observed; 
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Figure 10. GMM of feature vectors in (a) LS2 and (b) LS3 (dominance of tensile class), (c) LS5 and (d) LS6 (transition stage), and 
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GMM: Gaussian mixture modeling. 



then an increase in tensile signals was observed at LS8 
due to both nucleation of few additional flexural cracks 
at the edges of the specimen and mechanism of brittle 



fracture in compression at the bottom corners (Rocks, 
2012); in fact under compressive loads, brittle materials 
generally fail by the formation of tensile microcracks at 
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Figure I I. GMM of feature vectors at LS2 for each sensor: (a) Sensor 8, (b) Sensor 7, (c) Sensor 4, (d) Sensor 5, (e) Sensor 6, 
(f) Sensor I , (g) Sensor 2, and (h) Sensor 3. 
GMM: Gaussian mixture modeling; LS: load step. 
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Figure 1 2. GMM of feature vectors at LS9 for each sensor: (a) Sensor 8, (b) Sensor 7, (c) Sensor 4, (d) Sensor 5, (e) Sensor 6, 
(f) Sensor I , (g) Sensor 2, and (h) Sensor 3. 
GMM: Gaussian mixture modeling; LS: load step. 
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Figure 1 3. Data selection with higher probability in each 
cluster in LS6. 



indicated significant increases in the population of 
shear type signals. Table 1 summarizes the percentage 
of data in each cluster to the total data set in each load 
step and the associated drift ratio at loading centerline. 
Based on these observations, a SHM system could be 
designed for similar structures by defining a threshold 
on the percentage of shear mode signals. To find this 
threshold, more specimens should be tested; however, a 
percentage threshold of 60% for shear class seems rea- 
sonable in this study. It highlights yielding and ultimate 
strength at drift ratios of 0.62% and 1.3%, respectively. 
Therefore, as the percentage exceeds this threshold, an 
early alarm could be triggered as an indication of signif- 
icant damage. Thereafter, imminent failure should be 
prevented by applying an appropriate retrofitting sce- 
nario (Bracci et al., 1997; Paterson, 2001; Priestley and 
Seible, 1996). 



Table I . Percentage of AE activities for each group. 
Load step Drift ratio Tensile Shear 
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AE: acoustic emission. 
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Figure 14. Estimation of crack mode propagation by GMM 

algorithm. 

GMM: Gaussian mixture modeling. 



Conclusion 

In this article, a new approach based on a GMM was 
presented to perform crack mode classification in RC 
structures. To validate the proposed system, an experi- 
mental study was carried out on a full-scale RC shear 
wall subjected to a displacement controlled quasi-static 
reversed cyclic loading. The test specimen was instru- 
mented with eight AE sensors. During testing, two fea- 
tures were extracted from each AE signal detected, that 
is, RA value and AF. These features were used to 
populate a data set X; then the proposed GMM algo- 
rithm was used to partition the data set X in two clus- 
ters, tensile and shear. The GMM algorithm was 
capable to identify three stages during the test: (a) a 
dominance of tensile (flexural) cracks during the initial 
load steps, (b) a transition stage during the intermedi- 
ate load steps, and (c) a dominance of shear cracks dur- 
ing the final load steps. These results were validated by 
visual inspection. Moreover, a new criterion based on a 
likelihood ratio test for enhancing crack classification 
was proposed. Future work should investigate the 
effects of geometry, specimen ductility, loading rate, 
and sensor layout. 
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